Top-Down Gaze Targeting for Space-Variant Active Vision
نویسنده
چکیده
The simultaneous need for a wide angle of eld and high resolution has led to the use of spatially variant sensors in active vision systems. The use of such sensors however necessitates the existence of gaze control mechanisms for guiding the foveal high resolution region of the sensor to points of interest in the visual world. While bottom-up alerting cues such as motion have previously been used for this purpose, tasks such as visual search are better facilitated by top-down guidance mechanisms. In this paper, we describe the use of iconic scene descriptions for top-down foveal targeting. These descriptions take the form of a vector of responses of a bank of steerable lters at multiple scales and orientations. Such a representation has a number of useful properties such as rotation and scale invariance, partial view-insensitivity and tolerance to occlusions. Top-down control of gaze for uniform resolution sensors is achieved by the process of backpro-jection which matches vectors of a previously foveated point to instances of the point in other possibly transformed images. The multiscale structure of representation can be exploited to extend this procedure to the space-variant case.
منابع مشابه
Gaze shift reflex in a humanoid active vision system
Full awareness of sensory surroundings requires active attentional and behavioral exploration. In visual animals, visual, auditory and tactile stimuli elicit gaze shifts (head and eye movements) to aid visual perception of stimuli. Such gaze shifts can either be top-down attention driven (e.g. visual search) or they can be reflex movements triggered by unexpected changes in the surroundings. He...
متن کاملIntegration of Static and Dynamic Scene Features Guiding Visual Attention
This paper presents a visual attention module driven by static and dynamic scene features controlling the gaze shifts of an active vision system. A preattentive processing unit computes several static features, like orientation and color, and a dynamic feature, motion. We distinguish two further processing modes of our active vision system: the hypothesis validation mode and the tracking mode. ...
متن کاملAppearance-Based Object Detection in Space-Variant Images: A Multi-model Approach
Recently, log-polar images have been successfully used in active-vision tasks such as vergence control or target tracking. However, while the role of foveal data has been exploited and is well known, that of periphery seems underestimated and not well understood. Nevertheless, peripheral information becomes crucial in detecting non-foveated objects or events. In this paper, a multiple-model app...
متن کاملControl of gaze while walking: Task structure, reward, and uncertainty
While it is universally acknowledged that both bottom up and top down factors contribute to allocation of gaze, we currently have limited understanding of how top-down factors determine gaze choices in the context of ongoing natural behavior. One purely top-down model by Sprague, Ballard, and Robinson (2007) suggests that natural behaviors can be understood in terms of simple component behavior...
متن کاملSaccadic Object Recognition with an Active
An active vision system for saccadic camera gaze shifts and explorative scene analysis as a new integral approach to image understanding is proposed. The model includes several subsystems: preattentive peripheral feature detection, multi resolution foveal image identiication based on a hypercolumnar representation and object recognition by means of two memories for foveal identiications and xat...
متن کامل